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Abstract 

We study the complexity of deciding the equality of infinite objects specified 
by systems of equations, and of infinite objects specified by A-terms. For 
equational specifications there are several natural notions of equality: equality 
in all models, equality of the sets of solutions, and equality of normal forms 
for productive specifications. For A-terms we investigate Bohm-tree equality 
and various notions of observational equality. We pinpoint the complexity of 
each of these notions in the arithmetical or analytical hierarchy. 

We show that the complexity of deciding equality in all models subsumes 
the entire analytical hierarchy. This holds already for the most simple infinite 
objects, viz. streams over {0, 1}, and stands in sharp contrast to the low arith- 
metical if§-completeness of equality of equationally specified streams derived 
in [T7] employing a different notion of equality. 

1 Introduction 



In the last two decades interest has grown towards infinite data, as witnessed by 
the application of type theory to infinite objects [5], as well as the emergence of 
coalgebraic techniques for infinite data types like streams |19j . infinitary term 
rewriting and infinitary lambda calculus [24] . In functional programming, the 
use of infinite data structures dates back to 1976, see [14l HTj . 

We are concerned with the complexity of deciding the equality of infinite 
objects specified by systems of equations, and infinite objects specified by A- 
terms. The equational specification of coinductive objects is common practice in 
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coalgebra, term rewriting and functional programming. Consider the following 
example from [17] : 

zeros = : zeros ones = 1 : ones 

blink = 0:1: blink zip(a; : <j,t) = x : zip(r, a) 

This is an equational specification of three infinite lists of bits, and a binary 
function over infinite lists0 Then, a typical question is whether the following 
equality holds: 

zip(zeros, ones) = blink (2) 

The answer depends on the semantics we choose to interpret the equality; for 
example @ is not valid in the hidden models considered in [T7] ; for more details 
we refer to Section [2] In order to answer such a question, we first need to settle 
on the precise semantics of equality for equational specifications; the candidates 
we consider in this paper are 

I. Equality in all models. 

II. Equality of the set of solutions. 

For A-terms we are not concerned with equality in the sense of convertibility 
(which is known to be Il^-complete, see (T|). Instead, we are interested in 
behavioral equivalence of A-terms in all contexts, because this corresponds to 
the interchangeability of expressions in purely functional languages. It is also 
closely related to referential transparency, and the notion of Bohm trees as 
values of expressions including those without normal form. Thus we consider 
the following equivalences for A-terms: 

III. Observational equivalences. 

IV. Bohm-tree equality. 

The 'right' choice of equivalence depends on the intended application. The 
classic semantics mentioned in items U and HI] above, are defined by model- 
theoretic means. From a algebraic perspective U and |TT] are the most natural 
semantics to consider for equational reasoning. On the other hand, IIIII and 
IIV1 are defined by means of evaluation, i.e., rewriting. In functional program- 
ming the latter are of foremost importance, because these take (lazy) evaluation 
strategies into account. From an evaluation perspective, two terms are equal if 
they have the same observable behavior, independent of the context they are in. 
In contrast to the model-theoretic notions, this equality is invariant under the 
exchange of meaningless subterms, that is, subterms which cannot be evaluated 
to a (weak) head normal form. 

Another candidate for the semantics of equality is 

1 In Haskell there is zip :: [a] — > [b] — > [(a, &)], but we prefer to use 'zip' for the interleaving 
of lists, as defined by the equation in JT}, since that is what a zipper does: it interleaves rows 
of teeth. 
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V. Equality of normal forms for productive specifications. 

A rewrite specification is productive [331 IS] if the terms under consideration can 
be fully evaluated, that is, (outermost-fair) rewriting yields a (possibly infinite) 
constructor normal form in the limit. In such a setting, equality of the normal 
forms is a suitable semantics for the equivalence of terms. Deciding the equality 
of productive specifications has been shown to be a n^-complete problem in [T3] ; 
this semantics is not considered here. 

We now briefly describe the concepts U tilVI 

Equality in models (J] and HI]) . The semantics [T] (equality in all models) 
is useful when the objects under consideration are specified in the same spec- 
ification. This semantics interprets the objects simultaneously in each model 
satisfying the specification. This allows us to compare objects that depend on 
a common unknown, an underspecified object; see Q below for an illustrating 
example. If the objects under consideration are fully specified, that is, have 
unique solutions, then semantics U coincides with semantics UT1 

In contrast to[H semantics HI1 is more suitable for comparing objects specified 
by different specifications, as we explain below. The objects are compared via 
the set of their solutions (in their respective specifications). This semantics is 
well-known from equations over real (or complex) numbers, where two equations, 
like 

(x - l) 2 - 1 = and x 2 - 2x = , 

are equivalent if they have the same solutions for x, here {0, 2}. 

A E-algebra A consists of a carrier set A (the domain of A) and an interpre- 
tation [•] of the symbols E occurring in the equational specification as functions 
over A. Then A is called a model of an equational specification E, which we 
denote by A \= E, if all equations of E respect the interpretation; that is, for 
every equation of E both sides have the same interpretation for every assign- 
ment of the variables. As the domain we will typically choose (a subset of) the 
final coalgebra [2IJ describing the class of objects we are specifying. The final 
coalgebra ensures that the model is continuous, that is, if we have a converging 
sequence of terms t\,t2, ■ ■ ■ with limit t u , then the sequence of interpretations 
pi], P2], . . . converges towards [tj. For example, in a specification like 

ones = I : ones ones' = I : ones' (3) 

the symbols ones and ones' are guaranteed to have the same interpretation. 
Continuity is crucial to conclude the validity of equations such as ones = ones' 
which are not satisfied in non-continuous models like the initial algebra of the 
specification. 

Let E be a specification of M and N. Then M is considered equal to N 
with respect to semantics U if every model of E is also a model of M = N: 
MA. A \= E A \= M = N . This notion is especially of interest if M 
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and N depend on a common unknown and consequently have to be interpreted 
simultaneously in the same model. For example in 

M = z\p(X, X) z\p(x : <t,t) — x : zip(r, a) 
N = dup(X) dup(x : a) = x : x : dup(o-) 

the streams M and N are both specified in terms of an unspecified stream X. 
Whatever interpretation X has, M and N are equal, and so they are equal in 
the sense of semantics HJ 

On the other hand, semantics U has the effect that an underspecified con- 
stant is not equivalent to its renamed copy. This is illustrated by the following 
specification: 

M = : tail(M) N = : tail (AT) (5) 

Here M and N are not equal in every model; for example, let \M\ = 0:0:... 
and [TV] =0:1:1:.... Nevertheless, M and N are equal in the sense that 
they exhibit the same behaviors. That is, they have the same set of solutions: 
every stream starting with a zero is a solution for M as well as for N . Thus, 
M and A^ are equal with respect to the semantics [TTJ This paves the way for 
comparing objects M and N that are given by separate specifications Em and 
En, respectively. Note that it is not always suitable to apply semantics fl] to 
the union Em U En even if the specifications have disjoint signatures (using 
renaming), see further Remark [3] 

Two objects M and N are equal with respect to semantics [TT] if the set of 
solutions of M in E M coincides with the set of solutions of N in E N : { {M\ A 
A |= Em } = { [Nj A | A \= En } ■ Here the set of solutions of a constant X in 
a specification Ex is the set of interpretations of X in all models of Ex ■ 

Observational equivalence ()IIII and HVj) . In purely functional languages 
based on the A-calculus [T] , the evaluation of expressions is free of side effects. 
As a consequence, an expression (or subexpression) can always be replaced by 
its normal form, the so-called value of the expression. This principle is known as 
referential transparency. This also implies that expressions can be substituted 
for each other if they have the same normal form. 

For specifications of coinductive objects, such as infinite lists (called streams) 
or infinite trees, the value typically is an infinite term. For example in ones = 
1 : ones, the term ones has as value (or infinite normal form) the infinite term 
1:1:1:.... However, it is not always guaranteed that a term can be fully 
evaluated. During the evaluation to the (possibly infinite) normal form, we may 
encounter subterms that cannot be evaluated because these subterms do not 
have a head normal form. In A-calculus, such terms are known as meaningless 
terms. For example, consider: 

natsx(n) = n : g(0) : natsx(n + 1) g(n) = g(n) 

natsx'(n) = n : g(n) : natsx'(n + 1) 
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Here g(n) is meaningless for every n. Consequently, natsx(O) evaluates to a 
stream in which every second element is meaningless, and therefore, undefined. 
An infinite value containing undefined parts can be represented by means of 
Bohm trees [I] introduced in 1975 by Corrado Bohm. In particular, the Bohm 
tree of natsx(O) is: 0:±:1:_L:2:_L:3:_L:4:_L:..., where _L is a special 
symbol representing an undefined element. 

In A-calculus (or orthogonal higher-order rewriting), terms with equal Bohm 
trees can be exchanged (for each other) without changing the meaning of the 
whole expression. In the specification above, natsx(O) and natsx'(O) have the 
same Bohm tree, and hence are interchangeable. In contrast, from the model- 
theoretic perspective natsx(O) and natsx'(O) are different. In every model of 
natsx(O) all elements at odd indexes coincide, whereas natsx'(O) admits models 
that assign different interpretations to these elements. From a rewriting as well 
as functional programming perspective, these differences are irrelevant as they 
concern undefined subterms. 

There are several notions of infinite values, depending on what terms are con- 
sidered meaningless, including Bohm trees, Levy-Longo trees, Berarducci trees, 
?/-B6hm trees, ry^Bohm trees; see further [6]. The terms Xx.xx and Xx.x(Xz.xz), 
for instance, have distinct Bohm trees, but we may want to consider the terms 
behavior 'ally ', or observationally equivalent as they are 77-convertible. There are 
several natural concepts of observational equivalence for A-calculus, where terms 
are considered equivalent if they yield the same observations in every context. 
To that end, we consider three forms of observations: normal forms (nf), head 
normal forms (hnf), and weak head normal forms (whnf). A head normal form 
is a A-term of the form Xxi. . . . Xx n .yN\ . . . N m with n, m > 0. A weak head 
normal form is an hnf or an abstraction, i.e., a whnf is a term of the form 
xMi . . . M m or Xx.M. Each of the observations gives rise to an equivalence 
=nf, =hnf or = w hnf, defined by 

M = nf N iff (VC. C[M] has a nf iff C[N] has a nf ) 
M = hnf N iff (VC. C[M] has a hnf iff C[N] has a hnf) 
M = whnf N iff (VC. C[M] has a whnf iff C[N] has a whnf) 

In fact, the equivalence =„/ corresponds to ry-Bohm trees, and —hnf to ry^Bohm 
trees. For more details we refer to [6], where it is argued that = w hnf corresponds 
to evaluation strategies used by lazy functional languages. If two expressions 
behave the same in every context, then no functional program can distinguish 
them. 

Contribution. Wc characterize for each of the semantics Hl- liVI the complexity 
of deciding the equality of terms. For U and [TT| wc will focus on cquational 
specifications of bitstreams, and for IIIII and IIVI on behavioral equivalences of 
A-terms and Bohm tree equality. 

Each of these equivalences is undecidable, therefore we characterize their 
complexity by means of the arithmetical and analytical hierarchies, see Fig- 
ure U The arithmetical hierarchy classifies the complexity of a problem P by 
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the minimum number of quantifier alternations in first-order formulas that char- 
acterize P. The analytical hierarchy extends this classification to second-order 
arithmetic, then counting the alternations of set quantifiers. 

(A) It turns out that the complexities of deciding the equality in all models 
as well as the equality of the set of solutions subsume the entire arithmetical 
and analytical hierarchy when the domain of the models is the set of all streams, 
so-called full models, see Theorems [5] and |H The idea of the proof is as follows. 
We translate formulas of the analytical hierarchy into stream specifications by 
representing V set quantifiers by equations with variables. This simulates a 
quantification over all streams as the models are full, and the equations have to 
hold for all assignments of the variables. The 3 set quantifiers are eliminated 
in favor of Skolem functions (here stream functions) . The interpretation of the 
functions is determined by the model, and the question whether there exists a 
model corresponds to an existential quantification over all Skolem functions. 

® & © If we admit models whose domain does not contain all streams, 
then the complexity of deciding equality drops to the level 11} of the analytical 
hierarchy for semantics HI and to II 3, for UH see Theorems [TJ and [TTJ The reason 
is that equations with variables no longer have to hold for all streams, but only 
for the streams that exist in the model. By the Lowenheim- Skolem theorem we 
obtain that if there exists a model, then there exists a countable model: from an 
uncountable model we construct a countable one, by taking the finitely many 
streams "of interest" and closing them under all functions in the model. Thus, 
it suffices to quantify over countable models for which one single set quantifier 
is enough. 

The aforementioned results already hold for bitstreams, one of the simplest 
coinductive objects, and thereby can serve as a lower bound on the hardness of 

® 
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Figure 1: Arithmetical (bottom) and analytical hierarchy (top). 
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the equality problem for other coinductive objects. We also study the behavioral 
semantics from [17] . We find that if behavioral equivalence = is required to be 
a congruence, like for example in [2J, then the complexity of deciding behavioral 
equivalence is catapulted out of the arithmetical hierarchy, to the level 11} of 
the analytical hierarchy, see Theorem II 21 Likewise so for the behavioural equiv- 
alence for specifications of streams of natural numbers, relaxing the congruence 
requirement, see Theorem 1131 

@ For the equivalences on A-terms, we show that deciding the Bohm 
tree and Levy-Longo tree equality, as well as the observational equivalences 
=n/ 5 =hnf an d —whnf are n^-complete problems, see Theorem [T5l (It is clear 
that when an object is given by a rewrite system, like the A-calculus, then the 
complexity resides in the arithmetical hierarchy, since it suffices to quantify over 
a number steps to evaluate parts of the object.) 

(e) Finally, we consider the complexity of unique solutions. A term s 
has a unique solution within a specification E if there exists models of E, and 
in all models of E, s has the same interpretation. The problem of deciding 
unique solvability in all full models again subsumes the analytical hierarchy, see 
Theorems [6l [7] and [H When also considering the non-full models, we find that 
the problem is U\- and £}-hard, but is strictly contained in A3>, see TheoremsEJ 
[3] and |U 

Outline. We first discuss related work. We formally introduce bitstream spec- 
ifications and stream models in Section [3J and Turing machines with oracles in 
Section |4j We recall the basic complexity-related notions in Section [5] We 
use these concepts in Section [6] to derive the complexity results for the model- 
theoretic notions. In Section [7] we consider a different notion of models, namely 
the behavioural semantics as in 117) . Finally, we investigate the observational 
equivalences of A-terms in Section [5] 

2 Related Work 

The complexity of the equality of streams specified by systems of equations has 
been investigated in the ICFP paper [171 Corollary 1]; we cite: Proving equality 
on streams defined equationally is a HQ-complete problem. This result is based 
on a behavioral notion of stream models [16] . We briefly summarize the main 
characteristics of these models: 

(i) Every stream a 6 {0, 1}" can have multiple representatives in the model 
(known as confusion). 

(ii) For every equation £ = r it is required that the interpretations \£\ and 
[r] are behaviorally equivalent, denoted by =, that is, equality under all 
[head] ([tail]"(D)) experiments. In particular, it is not required that {£] = 

W- 

(iii) Behavioral equivalence = is not required to be a congruence. 
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Behavioral models have a wide range of applications, for example for modeling 
computations with hidden states, or capturing certain forms of nondctcrminism. 
For these applications it is often intended that the semantics is not preserved 
under equational reasoning. For example, consider the following specification 
from |T7] 

tail(push(<r)) = a , 

specifying a function push that prefixes an element to the argument stream, 
while leaving unspecified which element. In the behavioral models we obtain a 
restricted form of nondeterminism |25j . for example, the following is not behav- 
iorally satisfied: 

push(tail(push(cr))) = push(er) , (6) 

although derivable by pure equational reasoning. For a nondcterministic oper- 
ation, it is of course desirable that (|6]) does not hold. 

However, for function definitions employing pattern matching, behavioral 
models sometimes yield unexpected results; consider: 

ones = 1 : ones f (x : a) = a (7) 

Now, there are models that satisfy the specification (J7J, but not ((5J): 

f (ones) = ones (8) 

In these models we have that [ones] ^ [1 : ones] and, at the same time, that 
[ones] cannot be constructed by the stream constructor [:J, that is, [ones] ^ 
[:](x, s) for all x G {0,1} and s G As- Consequently, the interpretation 
[f]([ones]) can be arbitrary. 

Remark 1. We construct a behavioral model for specification [7] in which 
f (ones) = ones is not satisfied: 

As = {o} U {w | w G {0, 1}"} (the domain of the model) 
[head](o) = 1 [head](0w) = [head] (Ho) = 1 

[tail](o) = o [tail](0w) = w [tail](lw) = w 

[:] (0, o) = (l") [:] (0, w) = Ow [:] (1, w) = ltu 
[:](l,o) = l w [ones]=o 

[fi(o) = (r m(ow) = w m(iw) = w 

This model illustrates that the requirements of |17j do not ensure that every 
element of the stream domain Ag can be constructed by the stream constructor 
[:]. For example, the element o represents the stream of ones, but o ^ [:](a, b) 
for all a G {0, 1} and b G A s . In general, [M] and [head(M) : tail(M)] need not 
be the same element of the domain, although they are behaviorally equivalent. 
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Thus, behavioral reasoning is typically not sound for behavioral models, and 
therefore the corresponding specifications are usually referred to as behavioral 
specifications. In this paper we are interested in specifications where equational 
reasoning is sound. 

Remark 2. We construct a behavioral model (A, [•]) in the sense of [T7] where 
specification (flj is behaviorally satisfied but the goal equation zip(zeros, ones) = 
blink is not. The model thereby forms a counterexample to [TTJ Example 2]. 
We define the domain by Ab = {0,1} and 

A s = {z w | w £ {0, 1}*} U {o w | w S {0, 1}*} U {0, 1} W 

Here z £ and o e are alternative representations of 0" and 1" , respectively, and z w 
and o w have an additional finite prefix w £ {0, 1}*. We define the interpretations 
[•] for every a £ {0,1}, a £ {0,1}", w,v £ {0,1}* and x, y £ A s . For [head] 
and [tail] we define: 

[head](z £ )=0 [head](o e ) = 1 [head](acr) = a 

[tail](* e ) = z £ [tail](o e ) = o £ [tail] (aa) = a 

[head](z aM ,) = a [head](o atu ) = a 

[tail] (z aw ) = z w [tail] (o aw ) = o w 

We define the interpretation [:] of the stream constructor by: 

[:}(a,z w ) = z aw H(a,o w ) = o aw [:](a, a) = aa 

Note that the elements z £ and o £ cannot be constructed by [:]. 
We interpret [zeros], [ones] and [blink] as follows: 

[zeros] = z £ [ones] = o £ [blink] = (01) w 

We define an auxiliary function 1X1 that (similar to zip) interleaves the elements 
of finite or infinite words; for u u u 2 £ {0, 1}^ = {0, 1}* U {0, 1} W , let am XI 
U2 = a(u2 X u%) and e XI w 2 = We now define the interpretation [zip] of the 
symbol zip as follows: 



\z\p\{z w ,o v ) 
[zip](o w ,^) 



(w X w)0" for \w\ = \v\ 

(w0 u ) X (v l u ) otherwise 

(w X w)0 w for \w\ = \v\ 

(id 1 w ) X (v 0") otherwise 



and in all other cases, we define [zip] (x, y) — emb(x) X emb(y) where emb(z n 
w0 u , emb(o w ) = wl u , and emb(a) = a. 

We check that the specification [T] is behaviorally satisfied: 

[zeros] = z £ = z = [0 : zeros] 
[ones] = o £ = o\ = [1 : ones] 
[blink] = (01)" = 01(01)" = [0:1: blink] 
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Observe that z £ and zq (and likewise o £ and oi) are behaviorally equivalent. For 
the zip equation we distinguish the following cases: 



(i) If [cr] = z w , [t] =o„, we have: 
[zip(x : cr, r)] 

\x : zip(r, cr)] 



(xw ixi v)Q" if |xw| = \v\ 

(xw 0") M (u 1") otherwise 



x(v IX w)0" if |i>| = |w| + 1 

x((v 1") M (w 0")) otherwise 



The equality [zip(x : cr, r)] = [a; : zip(r, cr)] follows by the definition of IX 
together with \xw\ = \w\ + 1. 



(ii) The case [cr] = o v , [r] = z lu is analogous to (i) 

(hi) If [cr], [r] € {0, 1} W , we have: 

[zip(x: ex, t)] = (x[ct])M[t] 

= x([t] M [<t]) = [x:zip(r,cr)] 

(iv) If [cr] = z w , [r] G{0,1} W , then: 

[z\p(x : <t,t)J = (xw w )tX[r] 

= x([t] IX (u> 0")) = [x : zip(r, cr) 



(v) The case [cr] = o^,, [r] € {0, 1}" is analogous to (iv) 



(vi) The case [cr] £ {0, 1}", [r] = z w is analogous to (iv) 



(vii) The case [cr] e {0, 1}", [t] = o w is analogous to (iv) 

(viii) If [cr] = z w , [r] = z v , then: 

[zip(x : cr, r)] = (xw 0") M (w W ) 

= x((w 0") IX (w 0")) = \x : zip(r,cr) 



(ix) The case [cr] = o w , [r] = o v is analogous to (viii) 



Hence (A, [■]} behaviorally satisfies Specification [T] However: 

[zip(zeros,ones)] = [zip] ([zeros], [ones]) 
= [z\p](z e ,o s ) = 0" 



whereas 



[blink] - (01)" 



Consequently, the equation zip(zeros, ones) = blink is not behaviorally satisfied 
in this model. 
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The counterexample in Remark [5] employs the fact that the behavioral mod- 
els of [17 do not require that every stream can be constructed by the (in- 
terpretation of the) stream constructor [:|. As a consequence, the equation 
zip(x : a, t) = x : zip(Y, a) does not fully define [zipj; it defines [zip] (a, r) only 
for those arguments a that can be constructed by [:] . 

The example illustrates that the behavioral models of [T7] do not go along 
with function definitions using pattern matching. To fully define [zip], we 
can specify it using the stream destructors: head(zip(er, r)) — head(er), and 
tail(zip(er, t)) = zip(r, tail(cr)). This change of the specification format resolves 
the problem. 

Alternatively, keeping the specification format, we can adapt the notion 
of models. To reestablish soundness of equational reasoning one can (i) ex- 
clude confusion or (ii) require that = is a congruence. Note that the com- 
mon models of streams are free of confusion: final coalgebras [20 , one-sided 
infinite words A u , and the function space N — >• A. In hidden algebras [T5] . 
confusion is often allowed but its negative effects are prevented by restrict- 
ing to behavioral models [2,, in which behavioral equivalence is a congruence: 
s = t => /(..., s, .. .) = /(. . . ,t, . . .). Then equational reasoning is sound 
with respect to behavioral equality, and for a specification like ones = 1 : ones, 
ones' = 1 : ones', the equality g(ones) = g(ones') holds behaviorally. 

Our results show that when = is required to be a congruence (or confusion is 
eliminated) , then the complexity of the equality of bitstreams that are specified 
equationally jumps from the low level Ili] of the arithmetical hierarchy to the 
level 11} of the analytical hierarchy, thereby exceeding the arithmetical hierar- 
chy. Moreover, we show that even for behavioral specifications with confusion 
(as in [T7]). equality of streams of natural numbers is n}-complete. Conse- 
quently, the results of [T7] are valid only for bitstreams in combination with the 
behavioral equality discussed above. For general behavioral specifications (not 
the special case of stream specifications), the IlJ-completeness has been shown 

Term rewriting systems are closely related to equational specifications. The 
complexity of deciding various standard properties of term rewriting systems, 
such as productivity, termination and confluence (Church-Rosser), has been 
investigated in [8l [10] . 

3 Bitstream Specifications 

We will focus mainly on streams, one-sided infinite sequences of symbols, the 
prime example of coinductive structures. There are various ways of introducing 
streams: as functions N — > A mapping an index n to the ro-th element of the 
stream, as final coalgebras over the functor X H> A x X, using coinductive 
types pj], or observational models [2]. All these definitions are equivalent in 
the sense that the resulting coalgebras are isomorphic. 

For the model-theoretic semantics of equality, we will focus on specifica- 
tions of bitstreams, streams over the alphabet {0, 1}. Due to their simplicity, 
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bitstreams can be embedded in almost every non-trivial coinductive structure. 
Specifications of bitstreams are inherently sorted, with a sort B for bits, and a 
sort S for bitstreams. To this end, we introduce sorted terms. Let S be a set of 
sorts; an S -sorted set C is a family of sets {C s } se s- Let C and D be 5-sorted 
sets. Then an 5-sorted function (or map) from C to D is a function / : C D 
such that f(C s ) C D s for all s £ S, that is, a function that respects the sorts. 

An (S-sorted signature E is a set of symbols / G E, each having a type 
(si, . . . , s n , s) 6 <S™ +1 , denoted by / :: si X . . . X s n — ► s, where ra is the arity of 
/. Let A 1 be an 5-sorted set of variables. The 5-sorted set of terms Ter(E,X) 
is inductively defined by: 

- X s C Ter(E, A") s for every s £ 5, and 

- /(*!,..., i n ) S Ter(E,A% if / :: s x X . . . X s„ ->■ s, / € E, and i x € 
Ter(E,^) Sl ,...,i„ G Ter(E,Af) s „. 

An cS-sorted equation £ = r consists of terms £, r G Ter(E, Af) s x Ter(E, A") s for 
some s E S. 

Definition 1. A bitstream signature E is an 5-sorted signature with S — {B, S} 
such that 0, 1, : G E where 0, 1 :: B are called bits, and the infix symbol ':' of 
type B x S — > S is the stream constructor. An equational bitstream specification 
over E is a finite set i? of equations over E. 

From now on we let S — {B, S}. 

Definition 2. A stream algebra A — (A, [•]) consists of: 

(i) an 5-sorted domain A; A B = {0, 1} and ^ A s C {0, 1} N , 

(ii) for every / :: Si x ... x s n — > s £ E an interpretation [/] : A S1 x . . . j4 Sji — > A s , 

(iii) : G E with [:](x,cr) = a; : u, 

(iv) 0, 1 G E with [0] = and [1] = 1. 



The clause (iv) of the definition is optional; in fact, the results in this paper 
are independent of its presence. We have included it since the models where 
[0] = [1] are trivial, in the sense that then all bitstreams are equal. 

Definition 3. Let A = (A, [•]} be a stream algebra. Moreover, let a : X — > A be 
a variable assignment. As usual, the interpretation of terms |-]^ : Tfer(E, X) — > 
A is defined inductively by: 

H» = *{X) lf(h, . . . , t B )]£ = in(Ma, • • • » [*»la) 

Then A is called a (stream) model of if [£j Q = fr] Q for every I — r <E E and 
a : A" — » A. We write [-] a for whenever A is clear from the context. For 
ground terms t G Ter(E,0), we have [i] Q = [i]^ for all assignments a, /3; we 
then write [i] for short. 
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Thus, we interpret function symbols as functions over bits and bitstreams 
as imposed by their sort. In particular, terms of type S are interpreted as 
bitstreams. In contrast to [T7], our setup does not allow for confusion in the 
models. Recall that confusion means that the models can contain multiple 
representatives for the same stream. 

Definition 4. We say that a model A — (A, [•]) is full if its domain contains 
all bitstreams, A s = {0, 1} N . 



4 Turing Machines as Equational Specifications 

We now define a set of standard equations (for bitstream specifications) that 
will be used throughout this paper: 



zeros = : zeros ones = 1 : ones 
zip x (r) = t 
zip 2 (x : ti,t 2 ) = x : zip 2 (r 2 ,Ti) 
zip„(ri,...,r n ) =zip 2 (Ti,zip n _ 1 (T 2 ,...,T„)) (n > 2) t 



(9) 



To give an example, 

zip a ((r,T,p) = ff(0) : r(0) : a(l) : p(Q) : <r(2) : 

T(l):<7(3):p(l):<7(4):r(2):... 

writing cr(i) for the i'th entry of the stream a. 

We emphasize that all systems of equations in this paper are finite. To that 
end, we extend the specifications only by those equations from Q that are 
needed by the specification, that is, the equations zip n (. . .) = ... for which a 
symbol zip m with n < m occurs in the specification. 

Lemma 1. In every stream model A — {A, [■]) of a specification including the 
equations from Q we have: 

(i) [zerosl = 0" and [ones] = 1", 

(ii) for all o~i, . . . ,o~k € As, k > 2 and n €E N: 
[zipil(o-i) = o-i, 
[zip fc I(o-i,...,cr fc )(2ra) =ai(n) 

i zi Pfc](c r l,---,cr fc )(2n + l) = [zip fe _ 1 ](cr 2 , . . . ,a k )(n) 

A Turing machine M is a quadruple (Q,T,q ,5} consisting of a finite set of 
states Q, an initial state qo E Q, & finite alphabet T containing a designated 
blank symbol □, and a partial transition function S:QxT^QxTx {L, R}. 

For convenience, we restrict V to the alphabet T — {0, 1} where is the 
blank symbol □, and we denote Turing machines by triples (Q,qo,S). As input 
for the Turing machines we typically use a unary number representation 11 ... 1 
(n-times) to encode the number n. Of course, another encoding is possible, as 
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long as the encoding is computable, and the Turing machine is able to detect 
the end of the input (since is part of the input alphabet and it is also the 
blank symbol). 

We define a translation of Turing machines to equational specifications of 
bitstream functions, based on the standard translation to term rewriting systems 
from [53]. However, we represent the tape using streams instead of finite lists, 
and have one instead of four rules for 'extending' the tape. In particular, the 
equation for extending the tape is the equation for zeros from ©. The terms 
of the shape q{o~, r) represent configurations of the Turing machine, where the 
stream r contains the tape content below and right of the head, and a the tape 
content left of the head. Notably, the head of the machine stands on the first 
symbol of r. 

Definition 5. Let M = (Q,qo,S) be a Turing machine. We define the specifi- 
cation Em to consist of the following equations: 

q(x, b:y)= q'(b' : x, y) for every S(q, b) = (q' , b', R) 

q(a : x,b : y) — q'(x, a : b' : y) for every S(q, b) = (</, b', L) 

and for halting configurations additionally: 

q{x, b : y) = b whenever 5(q, b) undefined 

with the signature £ = {0, 1, :} U Q with types q :: S x S — >• B for every symbol 
q £ Q, and 0, 1 :: B and ':' of type B x S — > S . Moreover, we use Rm to denote 
the term rewriting system obtained from Em by orienting all equations from left 
to right. 

Apart from the additional rule for termination, the translation Rm is stan- 
dard, and the rewrite rules model the transition relation of Turing machines in 
one-to-one fashion. So we take the liberty to define input of tuples (m, . . . , nk) £ 
N fe and oracles directly on the term representations. We pass fc-tuples (m, . . . , rife) £ 
N fe of natural numbers as input to a Turing machine by choosing the follow- 
ing start configuration go( zeros i z 'iPk+i(ki n ii ■ ■ ■ > n k)) where n stands for (1 : 
)™ zeros. The particular encoding of tuples is not crucial, but zip fc+1 (fc, m, . . . , rife) 
is for equational specifications more convenient than the Godel encoding. 

We obtain machines with oracles £i , . . . , £ m C N by writing the oracles ele- 
mentwise interleaved on the tape left of the head: 

Notation 1. For n 6 N we use n to abbreviate (1 :)" : zeros. For (Cfi, we 
let £ denote the stream X{(0) : X^(l) : X{(2) : • • • where X(, is the characteristic 
function of £. We write a short for a\ , . . . , and a for a%, . . . , Ofc if k is clear 
from the context. 

For a term rewriting system R, we write for a rewrite step with respect 
to R, and -+* R is the reflexive-transitive closure of—>R. 
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Definition 6. Let M = (Q,qo,6) be a Turing machine. Then for stream terms 
£i> • • • >£m :: S an d ni, ■ ■ ■ j Tik S, we define 

M(£i,...,£ m ; rii,..., rife) := 

?o(zip m (£i,...,£ m ), ziPfe+i(fc,«i, rife)) 

Definition 7. A Turing machine M = (Q,qo,5) halts (with output b) on in- 
puts ni, . . . , rife € N with oracles £1, . . . , £ m C N if there is a rewrite sequence 
M(£i, . . . , £ m ; ni, rife) — >■„ 6, where b G {0,1}. Here £ is short for the 

stream x?(0) : Xf (1) : Xc(2) : • ■ • where x? is the characteristic function of £. 

Note that the initial term is infinite due to the oracles, nevertheless we 
consider only finite reduction sequences. Due to the rules for zip n and zeros, 
there are infinite rewrite sequences even if the Turing machine halts. However, 
i?M is orthogonal and therefore outermost-fair rewriting (or lazy evaluation) is 
normalizing, that is, computes the (unique) normal form b G {0, 1} if it exists. 

Definition 8. A k-ary predicate P with m oracles is a relation P C p(N)™ 1 x N k . 
Then P is called decidable if there is a Turing machine M such that for all 
£ G p(N) m and ft G N k : M halts on input ft with oracles £, and the output is 1 
if and only if P(£, ft). 

In correspondence with Definition [5] we define for £i, . . . ,£ TO , ni, . . . ,rife G 
{0,1}", [M](£i, . . . ,£ m ;ni, . . . ,n k ) as shorthand for [g ]([zip m ](£i, . . . , £,„), |[zip fc+1 ](|[fc], ni, n k )). 
Then for the models of Turing machine specifications we have: 

Lemma 2. Let P C p(N) m xN fe fee decidable, and M = (Q,qo,5) the correspond- 
ing Turing machine. Then in every stream model A = (A, [■]) of a specification 
including the equations from Q and Em we have for every £ G p(N) m and 
n G N fe : (£,ri) G P if and only if [M](£l, . . . ,f m ;ni, n fc ) = 1. 

Proof. P is decidable, hence M(£i, . . . ,£ m ;ni, . . . , rife) has a nf in {0, 1}, and 
the normal form is 1 if and only if (£, ft) G P. □ 

5 Levels of Undecidability 

We briefly introduce complexity related notions that are relevant for this paper: 
promise problems, reducibility, hardness and completeness, and the arithmeti- 
cal and the analytical hierarchy. For more details, we refer to the standard 
textbooks [22lfT8] , 

Definition 9. Let A C P G N. The promise (membership) problem for A with 
promise P is the question of deciding on the input of n G P whether n G A. For 
the case P = N, we speak of the membership problem for A. 

We identify the membership problem for A with the set A itself, and the 
promise problem for A with promise P with the pair (A,P), also denoted by 
A\ P . 
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Definition 10. Let A,B,P,Q C N. Then A\p can be (many-one) reduced to 
B\q, denoted A < B, if there exists a partial recursive function / : N — N such 
that P C domam(f), f(P) C Q, and Vn6P.nei« /(n) G B. 

Definition 11. Let B,QCMand?C p(N) x p(N). Then B\ Q is called 7>- 
hard if every ^4|p G V can be reduced to B\q. Moreover, B\q is V- complete if 
additionally £?|q can be reduced to some A\p G T 5 . 

We stress that Definition [TT] does not require that a P-complete promise 
problem _B|q is member of V itself. This allows for classifying promise problem 
using the usual arithmetic and analytical hierarchy (for membership problems). 

Lemma 3. If A\p can be reduced to B\q and A\p is P-hard, then B is "P-hard. 

We use ((•)) to denote the well-known Godel encoding of finite lists of numbers 
as elements of N: ((ni, . . . , rife)) := p" 1+1 ■ . .. ■ p^ k+1 , where p\ < P2 < ■ ■ ■ < Pk 
are the first k prime numbers. 

We define the arithmetical and analytical hierarchies: 

Definition 12. Let Eq := LTg := A{j be the collection of recursive sets of natural 
numbers (the decidable problems). Then for n > 1, we define: 

- E° consists of sets {n\3x£ N. {{x, n)) G B} with B € n°_ 1? 

- LT° consists of sets {n \ Vx G N. ({x, n)) G B} with B G S°_ x , 

-A" := s°nn°. 

The arithmetical hierarchy consists of the classes 11° , E° and A° for neN. 

For example, the membership a £ A for every set A G II!] can be defined 
by a formula of the form \/x\. 3x2- VX3. P(a, x\, X2, X3) where P is a decidable 
predicate. 

The analytical hierarchy extends this classification of sets to formulas of the 
language of second-order arithmetic, that is, with set (or equivalently function) 
quantifiers. The following definition makes use of a result from recursion theory, 
see [18j . stating that if there is at least one set quantifier, then two number 
quantifiers suffice (for functions quantifiers, one number quantifier suffices). 

Definition 13. Let Eq := LTg := Aj = UneN^ be the set of all arithmetic 
predicates. A set A C N is in Ll^ for n > if there is a decidable predicate P 
with m oracles such that for all a G N: 

aeA V£ x . 3£ 2 - •••3fm- Vx x . 3x 2 . P(£i,...,£ n ,a,xi,x 2 ) 
aeA V£i. 3£ 2 — V£ m . 3xi. Va; 2 . P(£i, a, xt,x 2 ) 

for n even, and n odd, respectively. Here, £1 , . . . , £ m C N, the corresponding 
quantifiers are set quantifiers, and Xi,X2 £N with number quantifiers. Then A 
is in E^ , if the condition holds with all V and 3 quantifiers swapped. Finally, 

Ai = ui n Ei . 
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6 Equality in Models 



In this section we study the complexity of different model-theoretic semantics 
of equivalence of bitstream specifications. Based on the notion of models for 
bitstream specifications from Section [3J we first formalize the equivalences that 
we consider. 

For all of the following model-theoretic equivalences, we have the choice 
whether or not we require the models to be full, that is, their domain contains 
all bitstreams. For example, we can consider the equality of terms in all models 
or in all full models: 

Definition 14. Let E be a bitstream specification over E, and s,( G Ter(E, X) 
with s,t :: S. Then s and t are said to be 

- equal in all models of E if 

A |= E implies A \= s = t for all stream algebras A , 

- equal in all full models of E if 

A\= E implies A\= s = t for all full stream algebras A . 

The set of solutions of a term s in a specification E is the set of interpreta- 
tions [s] of s in all models satisfying E: 

Definition 15. Let £ be a bitstream specification over E, and s G Ter(E, 0) 
with s :: S. Then the set of 

- solutions of s in E with respect to all models is 

14e = {M a \A\=e}, 

- solutions of s in E with respect to all full models is 

MKfun^lM" 4 \ A Ml, A^E}, 

Here it suffices to consider only ground terms s G Ter(E,0). For terms 
t G Ter(E, X) with variables, the set of solutions can be defined as {t} E = 
{Ma I A h E, a : X -> A }. However, then [tj E = [sJ E if s is the ground term 
obtained from t by interpreting the variables in t as fresh constants (formally, 
this amounts to an extension of the signature) . 

Definition 16. Let E s and E t be bitstream specifications over E s and E t , 
respectively. Let s G Ter(E s , 0) and t G Ter(E f , 0). Then s and t have 

- equal solutions over all models if [s] B = [£] Bt , 

- equal solutions over all full models if [s]^ full = lt] Et full . 

Definition 17. Let E be bitstream specifications over E, and s G Ter(E,0). 
Then s is said to have 
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- a unique solution over all models if [s]^ | = 1, 

- a unique solution over all full models if | [s] # f u n I = 1> 

- a solution over all models if |[s] E | > 1, 

- a solution over all full models if | [s] E full > 1 , 

- at most one solution over all models if |[s] E | < 1, 

- at most one solution over all full models if |[s] B full | < 1. 

6.1 Auxiliary Definitions 

First, we define a few (systems of) equations that are repeatedly used throughout 
this section. The following function is zer os that maps zeros to ones and every 
other bitstreams to zeros: 

is zeros (zeros) = ones is zerO s(0 : a) = is zeros (er) 
IS Z eros (1 : er) = zeros 

This function does exactly what its name suggests; it checks whether the argu- 
ment is the stream of zeros. We use the bit or the stream zeros for false, and 
1 and ones for true. 

We focus on specifications of bitstreams, and encode streams of natural 
numbers as bitstreams via the sequence of run-length of ones. For instance, the 
stream 3 : 1 : : 2 : . . . is encoded as 1:1:1:0:1:0:0:1:1:0:.... We 
then define functions uhd and utl that are the unary counterpart for head and 
tail on streams of natural numbers: 

uhd(0 : er) = zeros utl(0 : er) = a 

uhd(l : er) = 1 : uhd(cr) utl(l : a) = utl(cr) 

For instance, we have 

uhd(l :1:1:0:1:0:0:1:1:...) = 1:1:1: zeros 
utl(l :1:1:0:1:0:0:1:1:...) = 1:0:0:1:1:... 

The following lemma summarizes these properties: 

Lemma 4. In every stream model A — {A, [■]) of a specification including the 
equations from (|10j) and pip we have: 

(i) [is Z eros](0 W ) = 1", 

[is Z eros]H = 0" f° r every we4 s \{0 u }, 

(ii) luhd](l n 0io) = l n w for every w € A s , 
|uhd](l w ) = 1", 

(Hi) [ut\}(l n 0w) = w for every w € As. 
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Note that all interpretations are uniquely denned, apart from the combina- 
tion [utl](l w ) which can be any stream depending on the model. To avoid this 
case, we need means to ensure that a certain bitstream is a valid encoding of a 
stream of natural numbers, that is, the stream contains infinitely many zeros: 



natstr(ones) = zeros natstr(0 : a) = 1 : natstr(er) 
natstr(l : a) = natstr(er) 



(12) 



Then an equation natstr(X) = ones guarantees that [XJ represents a stream of 
natural numbers: 

Lemma 5. In every stream model A = (A, [•]} of a specification including 
the equations from (|12[) we have: [natstr] (w) — l u if and only if w contains 
infinitely many zeros. 

Proof. The equations on the right 'walk' over the stream, deleting l's and con- 
verting O's to l's. If the stream contains infinitely many O's, then an infinite 
stream of l's will be produced. However, if some tail of the stream contains 
only l's then the equation on the left ensures that the interpretation is unequal 
to 1". □ 

Definition 18. Let M = (Q,qo,S) be a Turing machine. Then the canonical 
model A — (A, [•]} for the union of the specifications Em, ©, (fT0|) . (fTTj) and 
(fT2"|) consists of the domain As — {0, 1} N with interpretations [•] as given in 
Lemmas S] and O extended by 

(i) [utl](l") = 1", 

(ii) for every £ m , ni, . . . , n k C N: 
I<Z](£)") = 1 whenever [<?](£, n) — >* 1, and 
[qj((,,n) = otherwise. 

Lemma 6. The canonical model is a model of the union of the equational spec- 
ifications R M , ©, HOD, dJ and ([T2"]). 

Proof. The rewrite system Rm is orthogonal, consequently we have finitary con- 
fluence and infinitary unique normal forms |24j . Hence, we can employ a normal 
forms semantics for [qj (where we map terms without normal forms to 0). For 
the remaining equations, it is easy to see that the chosen semantics forms a 
model. □ 

6.2 Equality in all Models 

For the complexity of equality in all models we obtain: 

Theorem 1. The following problem is complete: 
Input: Bitstream specification E, terms s,t :: S. 

Question: Are s and t equal in all models of E? 
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Proof. The well-foundedness problem for decidable binary relations is known 
to be nj-complete, that is, the problem of deciding on the input of a decidable 
binary predicate MCNxN (given in the form of a Turing machine) , whether M 
is well-founded. We reduce this problem to an equality problem. Let MCNxN 
be a decidable predicate, and M = (Q, go, 6) the corresponding Turing machine. 
We dehne the following specification E: 

S = is ze ros(run(l, X)) natstr(Jf) = ones 

run(0, a) = ones *(<r) 

run(l, a) = : run(M(zeros; uhd(cr), uhd(utl(<r))), utl(cr)) 

together with the equations from Em and (|10|) . (JTTJ) and ([12]). We prove 
that: E \= S = zeros if and only if M is well-founded. 

For '=>' let M be non-well-founded, and no M rii M n,2 M . . . be an infinite 
chain. We construct a E-algebra A = (A, [•]) such that A \= E but not A \= 
S = zeros. We define A as an extension of the canonical model (Definition [T5]). 
The values of [$(tr)l and [utl(er)] are determined by the canonical model, and 
together with the equations for run we obtain for every stream £ G {0,1}": 
[run](0,0 = 1", and [run](l,0 = : [run] ([*(£)], [utl](0). Hence, there is a 
unique interpretation [run] that results in a model for the equations of run. We 
define = 1™- 1™*+ 2 . . .. and we let n = 1" (T. Then for i G N we have 

[run](l, Kl ) = 0: [run]([$]( Kl ),^+i) 
= : [run]([M](nj,7t i+ i),K i+ i) = : [run] (1, Ki+1 ) 

since we have that [uhd](«j) = rij and [utl](«j) = Hj+i for all j G N by 
Lemma |U Thus, |run](l,K ) = ~0". Let [X] = k and [S] = l w . Then 
[natstr]([X]) = [ones] byLemmaEl and [S] = [is zer0 s] ([run] (1, [X])) by LemmaH 
We have constructed a model, where [S] = 1", and, hence, E Y= S = zeros. 

For ( <=' let M be well-founded. Let A be a E-algebra such that A \= E. We 
show that [S] = 0". Since [natstr]([X]) = [ones], [X] contains infinitely many 
zeros by Lemma M Thus, [X] = l n ° l™ 1 l™ 2 . . . for some n , n x , n 2 , . . . G N. 
Let n t = l n * 1"*+! l n -+ 2 ... for i G N. Then 

[run](l,Ki) = 0: [run]([M](n„ n l+1 ), Ki+1 ) 

= f[run](l,K i+ i) if [M](^,n m ) = l 

[ [run](0,K i+ i) = 1" if [M](rH,n i+ i) = 



Hence, [run](l, [X]) = 0" if and only if [M}(rH, n i+1 ) = 1 for all i G N. However, 
this would contradict well-foundedness of M. As a consequence, we obtain that 
[run](l,[X]) ^ 0" and [S] = is zero s([run](l, [X])) = 0" by Lemma g] This 
concludes the IlJ-hardness proof. 

To show nj-membership, we resort to the Lowenheim-Skolem theorem. It 
states that if a formula of first-order predicate logic has an uncountable model, 
then it also has a countable model. Here, we employ that the domain A$ can 
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be encoded as an arbitrary set with functions [head] :: As — > {0, 1} and [tail] :: 
Ag — > As together with a first-order predicate logic formula that excludes 
confusion, that is, elements a, b G A s with [head] [tail] "(a) = [head] [tail]" (6) for 
all n £ N are required to be equal, that is, a = b. Likewise, the interpretations 
of the symbols in E can be translated to first-order predicates, and validity of 
the equations to first-order formulas. As a consequence, A \= E A [s] ^ [t] can 
be expressed as first-order formula, and if it has a model, then also a countable 
one. Hence, it suffices in MA. A \= E =>■ [s] = [<] to quantify over countable 
models. For this purpose of quantifying over countable models, a set quantifier 
\/A C N suffices. This proves n}-membership. □ 

The following three results are obtained by slight adaptations of the proof 
of Theorem [TJ In the proof of Theorem [TJ we have E \= S = zeros if and only 
if S has a unique solution over all models of E. As a consequence, we obtain 
the following results concerning (unique) solvability: 

Theorem 2. The following problem is H\-complete: 
Input: Bitstream specification E, term s. 

Question: Does s have at most one solution over all models of E? 

Proof. The nj-hardness follows from the proof of Theorem [TJ as S has < 1 
solutions if and only if M is well-founded. 

The membership in 11} uses that it suffices to consider countable models as 
in the proof of Theorem [TJ Then the formula characterizes the property having 
at most one solution: MAi. MA 2 . {Ai \= E) A (A 2 \= E) =>- \s\ Al = \s\ M . The 
two V set quantifiers can be merged into one, and the properties A \= E, and 
l s J Al — l s J A2 are arithmetic. Hence, the property is in n}. □ 

Theorem 3. The following problem is H\- complete: 
Input: A bitstream specification E, a term s. 

Question: Has s a solution over all models of E? 

Proof. The £}-hardness follows from a tiny adaptation of the proof of Theo- 
rem [TJ We replace the equation S = is zero s(run(l, X)) by the equations S = 
run(l,X) and S = zeros. Then every model where [run(l,X)] ^ 0^ is ruled out, 
and hence, the specification has a model, and S a solution, if and only if M is 
not well-founded. 

The membership in T,\ can be described by the following formula (we again 
use that we only need to quantify over countable E-algebras): 3.4. A \= E. 
Hence, the property is in E}. □ 

Theorem 4. The following problem is Il\-hard, T\-hard and strictly contained 
in IS.\: 

Input: A bitstream specification E, a term s. 
Question: Has s unique solutions over all models of E? 
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Proof. The ±l}-hardness follows from the fact that the specification used in the 
proof of Theorem [2] always has a solution; then unique solvability coincides with 
at most one solution. 

The Ej-hardness is a consequence of the fact that the specification used in 
the proof of Theorem [3] always has at most one solution (due to the equation 
S = zeros); then unique solvability coincides with at least one solution. 

For the A^-membership we observe that a term s has a unique solution if 
and only if s has at least and s has at most one solution. Therefore unique 
solvability can be described by the conjunction of a n}- and a E}-formula. □ 



6.3 Equality in all Full Models 

In Section 16.21 we have considered models whose domain was any non-empty 
set of bitstreams (As Q {0,1}"). However, when writing equations such as 
even (a; : y : r) = x : even(r), the intended semantics is often that these equations 
should hold for all streams, that is, in full models with domain Ag — {0, 1}". We 
find that the restriction to full models results in a huge jump of the complexity, 
which then subsumes the entire analytical hierarchy. 

To prepare for the proof, we introduce some auxiliary specifications. We de- 
fine nat such that an equation natpQ = ones guarantees that the interpretation 
[X} represents a natural number in unary encoding, that is, [X] = 1™ W for 
n G N, as follows: 

nat(0 : 1 : a) = zeros nat(l : a) = nat(er) 

nat(0 : : a) = nat(0 : cr) nat(ones) = zeros 

Lemma 7. In every stream model A = (A, [■]) of a specification including the 
equations from (fT3"|) we have: if [nat](w) = 1" then w — 1" 0" for some n G N. 

Proof. If a stream is not of the format 1" 0" for some n G N then it is 1" or 

contains ... 01 The last equation rules out the case 1" (ensures that the 

interpretation is not l w ). 

The first three equations are exhaustive in the sense that every stream can 
be matched by one of them. The first equation rules out streams that contain a 
1 after a 0, and the equations two and three 'walk' step by step over the stream 
(proceed with the tail). □ 

We moreover define a function leq such that \eq(X,Y) = ones guarantees 
that pointwise [Xj < [Y] : 



leq(0 : cr, x : t) = \eq(a,r) 
leq(l : cr, 1 : r) = leq(cr, r) 
leq(l : a, : t) = zeros 




Lemma 8. In every stream model A = (A, |-|) of a specification including the 
equations from (| 14[) we have that if [leq](cr, t) = 1". then a is pointwise < than 
t (for all cr, r G As )■ 
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Lemmas [7] and [5] are valid for non-full models as well. As explained in the 
introduction, the assumption of full models is crucial to guarantee that equations 
with variables have to hold for all streams (assigned to the variables) and not 
only the streams in the model. 

Theorem 5. The following problem subsumes the analytical hierarchy: 
Input: Bitstream specification E, terms s,t :: S. 

Question: Are s and t equal in all full models of E? 

The idea of the proof is as follows. We translate formulas of the analytical hi- 
erarchy into stream specifications by representing V set quantifiers by equations 
with variables. This simulates a quantification over all streams as the models 
are full, and the equations have to hold for all assignments of the variables. 

The 3 set quantifiers are eliminated in favor of Skolcm functions /, that is, 
axioms of the form \fx.3y.ip(xi, . .. ,x n ,y) are replaced by \fx.ip(xi, x%, . . . , x n , f(xi, 
The interpretation of these functions is determined by the model, and the ques- 
tion whether there exists a model corresponds to an existential quantification 
over all Skolem functions. 

Proof. For every analytical set A, we reduce the membership problem in A to an 
equality problem. Every set A of the analytical hierarchy can be defined by 

a(£A (15) 
V£i. 3£ 2 - V£ 3 . . ..3£„. Vxi.3x 2 . M(£i,...,£„,a, x\,xi) 

where n £ N is even (without loss of generality since II* C n* +1 ) and M a 
decidable predicate. Let M = (Q,q ,5) the Turing machine corresponding to 
M. Let a £ N be given. We define E to be the following system of equations: 

S(ti,t 3 ,...,t„_i) = run(l, zip„(n, g 2 (n), r 3 , g 4 (n, r 3 ), 

■ • .)7V,_l,gn(Tl,T 3 , . . .,T n -i)), Zeros) 

S(ti,t 3 ,...,t„_i) = zeros 
run(0, t, 71) = ones 

run(l,T,7i) = : run(M(r; A,7ij M^Ti)), r, 1 : 71) 
A = (1 :) a zeros 
nat(hi2(r, 71)) = ones 

together with the equations from Em, ([9]), and (fl3|) . The symbols gm are typed 
y — > S. We claim: E \= zeros = ones if and only if a € A. For this purpose it 
suffices to show that the specification has a model (3.4. 4 |= E) if and only if 
the formula in the right-hand side of ([T5|) is valid. 

The idea is that the specification models a Skolem normal form of the an- 
alytical formula in (|15p. The V set quantifiers are modeled by an equation 
with stream variables; recall that equations have to hold for all assignments of 
the variables. In particular, the variables ti,t 3 , . . . ,r„_i in the first equation 
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S(ti,T3, • • • )T n _i) = . . . model the set quantifiers V£i, . . . ,V£ n _i, respectively. 
The 3 set quantifiers are modeled by Skolem functions <?2)<?4> ... ,g n which in 
the specification are stream functions that get the value of the preceding V 
quantifiers as arguments. These stream functions gn are unspecified and can 
be 'freely chosen' by the model A. Thus, the existential quantification over the 
Skolem functions corresponds to the existential quantification over all models 
in 3A. A\= E. 

The streams t\, 32 • ■ • , 7Vi-i> 9n{ T \i T 3i • ■ • 1 T n-i) that represent the values 
of the set quantifiers are then interleaved by zip n , and passed as the second 
argument, named r, to run; this argument serves as the left side of the tape for 
every invocation of the Turing machine M . 

The Vzi number quantifier is modeled by the third argument 71 of run. 
The initial value of 71 is zeros, and T : □' is prepended (corresponding to 
counting up) each time the Turing machine halts with output 1. The number 
quantifier is modeled by the Skolem function ri2 for which the equation 
nat(h2(r, 71)) = ones ensures by Lemma 0] that the interpretation [^(t, 71)] 
is a unary encoding of a natural number. Then the term M(r; A, 71, ^{t, 71)) 

With T = zip„(Tl,g2(Tl),T3,g 4 (Tl,T3), . . . (n, r 3 , . . . , t„_i)) corresponds 

precisely to M(£i, ...,£„, a, xi,x 2 ) in ([T5]) . 

For '<=', assume that the formula in (fT5j) is valid. We construct a model 
A = (A, [■]) as an extension of the canonical model (Definition ITS]) . For 
[52], [54], ■ • ■ , [^2] we pick the Skolem functions for the quantifiers 3^2, 3^4, . . . , 3£ n , 3x 2 , 
respectively (where [/12] is a stream function that works on the unary encod- 
ing of natural numbers). For a 6 {0,1}", we define [nat](er) = 1" if a is 
of the form 1™0", and 0", otherwise. The definition of [run] is analogous to 
the proof of Theorem [TJ Finally, wc define [S] (ti, T2, . . . , r„_i) — 0" for all 
n, r 2 , . . . , r n _i e {0,1}", and {A} = l a 0". Then it is straightforward to verify 
that A is a model of the specification. 

For '=>', let A — (A, [•]) be a model of the specification. Then we let the 
existential quantifiers 3£ 2 , 3^4, . . . , 3£„ and 3^2 in (fT5j) behave according to the 
interpretations [52], [54], ■ • ■ , [<7n]> [^2], respectively (here the translation from 
sets ^ C Nto streams ^ is as usual) . Assume that there exists an assignment of 
the V quantifiers V£i, V^2, • ■ • , V^„_i and Vx2 for which the formula in (fT5|) is not 
valid, that is, M(^x, • ■ • , Cm °> ^li ^2) does not hold where the existential choices 
are governed by the model as described above. We translate this 'counterex- 
ample' back to the model by considering [S](£i,£3 ■ ■ • ,£n-i)- As in the proof 
of Theorem [lj it is then straightforward to show that [S](£i,£3 ■ • • >£«-i) 7^ 0". 
However, this contradicts the assumption of A being a model due to the equation 
S(ti,t 3 , ... ,r n _i) = zeros. □ 

The proof of Theorem [5] immediately yields the following: 

Theorem 6. The following problem subsumes the analytical hierarchy: 
Input: Bitstream specification E, term s. 

Question: Does s have a solution over all full models of E? 



24 



Proof. Follows from the proof of Theorem O as zeros has a solution over all 
models of E if and only if E has a model. □ 

Theorem 7. The following problem subsumes the analytical hierarchy: 
Input: Bitstream specification E, term s. 

Question: Does s have a unique solution over all full models of E? 

Proof of Theorems^ and^ In the proof of Theorem zeros has a (unique) 
solution if and only if E has a model. □ 

For the proof of the following theorem, we slightly adapt the specification in 
the proof of Theorem [5] such that it always has a solution, and has more than 
one solution if and only if the analytical formula in (fT5j) holds. 

Theorem 8. The following problem subsumes the analytical hierarchy: 
Input: Bitstream specification E, term s. 

Question: Does s have at most one solution over all full models of E? 

Proof. We adapt the proof of Theorem [5] by exchanging the two equations 
S(. ..) = ... by the following one: 

ones= leq(S, is zeros (run(l, zip n (n, g 2 (ri), t 3 , g 4 (ri, r 3 ), 

■ ■ .,T n _i,gn(ri,r 3 , . . . ,7-7,-1)), zeros))) 

An interpretation [S] = 0" always yields a solution. In addition, by LemmaOwe 
have [S] ^ 0" only if [is zer os(. • •)] ^ 0" for every assignment of t%, T2, . . . , r„_i. 
But then [is zeros (. . •)] = 1" by LemmalU and, thus, [run(. . .)] = 0". As in the 
proof of Theorem [5J [run(. . .)] = W for all 71,72, . . . , r„_i if and only if the 
formula in (|T5j) holds. □ 

6.4 Equality of Solutions 

In this section, we study the complexity of deciding whether terms have the 
same set of solutions over all (full) models. It is easy to see that the hardness of 
these problems is at least that of deciding equality in all (full) models. When 
considering all models, the problem turns out IT^-complete, and, thus, higher 
than the degree 11} of equality in all models. 

Remark 3. Let us briefly discuss the applicability of equality in all (full) models 
for the comparison of terms s, t that are specified in independent specifications 
E s and E t . First, we rename the symbols of one of the specifications such that 
E s n Kj = {0,1,:}. Thereafter, we consider the validity of s = t in the union 
E s UE t . 

We show on two examples that this approach docs not always yield the 
intended results. Let Em consist of the single equation M = 1 : M, and En of 

N = inv(A^) inv(0 : a) = 1 : inv(cr) inv(l : a) = : inv(a) 
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Then M has the stream of ones as unique solution, but N has no solution. Since 
En does not have model, the union Em U En also does not admit one. Thus, 
Em U En \= M = N holds for trivial reasons. Nevertheless, we would not like 
to consider M and N as equivalent (at least if they are given by independent 
specifications). 

Even if the specifications have unique solutions, a similar effect can occur. 
Let M = zeros and Em consist of the equations 

iszeros(nxor(<r)) = zeros 

nxor(0 : : a) = 1 : nxor(cr) nxor(0 : 1 : a) = : nxor(er) 

nxor(l : : a) = : nxor(cr) nxor(l : 1 : a) = 1 : nxor(cr) 

together with the equations (fTU)) . Let N = blink and En consist of the equation 
blink = 0:1: blink. Both specifications have models, and zeros and blink have 
unique solutions. For example, Em admits a model whose domain consists of 
all eventually constant streams. However, Em rules out models for which there 
exist elements a £ As with [nxor](er) = 0". In particular, the stream 0101 . . . 
is excluded from the domain Ag. As a consequence, the union Em U En has no 
models, and Em U En \= zeros = blink holds. 

As a consequence of the proof of Theorem [5j we obtain: 

Theorem 9. The following problem subsumes the analytical hierarchy: 
Input: Bitstream specifications E s , E t , ground terms s,t:: S. 

Question: Do s andt have equal solutions over all full models, that is, [s} E ^ ull ■■ 

Proof. Let E s be the specification in the proof of Theorem El and s = zeros. 
Then {s} E full = {0"} if E s has a model, and otherwise. Let E t = {zeros' 
() : zeros'} and t' = zeros', then we have {t} E full = {0"}. Thus, [s} E full = 
[t} Et full is equivalent to E (= zeros = ones in the proof of Theorem [5] □ 

We conclude this section with an investigation of the complexity of deciding 
whether two terms have the same set of solutions over all models. The proof of 
Theorem [1] yields only n}-hardness. In order to show n^-hardness, we employ 
a result of [1] stating that it is a n^-complete problem to decide whether the 
w-language of a non-deterministic Turing machine contains all words {0, 1}^. 

Therefore, we consider non-deterministic Turing machines with one-sides 
tapes. Without loss of generality, we may restrict the non-determinism 5 : 
Q x T — > p(Q x F x {L, R}) to binary choices in each step, that is, \S(q, b)\ < 2 for 
every q € Q and b E {0, 1}. (Broader choices then are simulated by sequences 
of binary choices.) Moreover, for our purposes, it suffices to consider Turing 
machines that never halt. For the w-language, halting always corresponds to 
rejecting a run, and this rejection can be simulated by alternating moving forth 
and back eternally. 
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That is, a non-dctcrministic Turing machine M = {Q,qa,So,Si} has two 
transition functions 5q,8i : Q x T — > Q x V x {L,R} and we allow a non- 
deterministic choice between these functions in each step. Note that, for mod- 
eling non-determinism in an equational specifications, we cannot take the union 
of the specifications -E{Q. go ,5 ) and E(Q, qo ,Si)i since multiple equations having 
the same left-hand side do not model choice, but additional restrictions on the 
models of the specification. To this end, we introduce a third argument for the 
binary function symbols q £ Q in Definition [5] This argument then governs 
the non-deterministic choice. In order to model one-sided tapes, we introduce 
a fourth argument that stores the position on the tape, and is increased, when 
moving right, and decreased, when moving left. That is, we adapt Definition [3] 
to: 

q(x,b :y,i: z,p) = q'(b' : x,y,z,l : p) 
q(a : x, b : y, i : z, 1 : p) — q'(x, a : b' : y, z,p) 

for Si(q,b) — (q',b',R) and Si(q,b) — (q',b',L), respectively. We use E^ to 
denote this specification, and for the corresponding term rewriting system. 
In the initial configuration, the third argument should be an underspecified 
stream, allowing for any non-deterministic choice. We pass zeros as fourth 
argument, thereby ensuring that the head cannot move to negative tape indices. 

A run of M on an w-word w £ {0, 1}" is a R^ rewrite sequence starting from 
a term qn (zeros, w, N_, zeros) where N £ {0, 1}" determines the non-deterministic 
choices; here w is the term w(Q) : w(l) : . . . A run of M is complete if every tape 
position p > is visited (that is, positions right of the starting position), and it 
is oscillating if some tape position is visited infinitely often. A run is accepting 
if it is complete and not oscillating, that it, it visits every position p > at least 
once, but only finitely often. 

Definition 19. The w-language C U (M) is the set of all w-words w £ {0, 1}" 
such that M has an accepting run w. 

We employ the following result, which follows from [4]: 

Theorem 10. The set {M | £<"(M) = {0,1}"} is n\- complete. 

We are now ready for the proof of Ilj-completeness of equality of the set 
of solutions over all models. In the proof, we introduce a fifth argument for 
the symbol q £ Q in E^ which enforces progress (productivity) and rules out 
exactly the oscillating runs. 

Theorem 11. The following problem is Yi\-complete: 

Input: Bitstream specifications E s , E t , ground terms s,t :: S. 

Question: Do s and t have equal solutions over all models equal, that is, 
Mb. = Mb, ? 
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Proof. Let M = (Q, q , So, 5i) be a non-deterministic Turing machine. We reduce 
the problem in Theorem [TO] to a decision problem for the equality of the set of 
solutions over all full models. We let s = X and define the specification E s to 
consist of: 

go (zeros, X, N, zeros, P) — zeros (16) 
natstr(P) = ones (17) 

q(x, b:y,i: z,p, 1 : v) = q'(b' :x,y,z,l: p, v) (18) 
ioT5i(q,b) = {q',b',R) 
q(a : x, b : y, i : z, 1 : p, 1 : v) = q'(x, a:b':y, z,p, v) (19) 

for Si(q,b) = (q',b',L) 
q(x,y,z, 1 :p,0 : v) = : q(x,y,z,p,v) (20) 
q(x, y, z, : p, : v) = ones (21) 
q(a : x, b : y, i : z, : p, 1 : v) = ones (22) 

ioT8i{q,b) = {q',b',L) 

The equation (fTB"]) starts M on the stream X with non-deterministic choices 
governed by N and P for enforcing progress. The streams X and N are unspeci- 
fied, thus arbitrary. The equation (TT71) ensures that [P] contains infinitely many 
zeros. The equations (fT8)l and (fT9|) model the computation of M as discussed 
before, but now in each step removing the context 1 : □ from the fifth argument. 
If the fifth argument starts with a 0, then (|20|) decrements the position counter 
(the fourth argument). Recall, the position counter determines how many steps 
the Turing machine M is permitted to move left. Thus, always eventually decre- 
menting the counter rules out the oscillating runs. The equations (|2"Tj) and (|22p 
rule out models where the head move left of the envisaged progress [P] . 

It is important to note that for any non-oscillating run a, we can define a 
function p : N — >• N such that after p(n) steps, M visits only tape indices > n. 
Then an assignment [P] = l p(0) l p(2) ... in the model will permit this 
run to happen, that is, the head will never fall behind the envisaged progress 
and Equations (I2T1) and (|22|) do not apply. 

As a consequence, we have [sj E = {0,1}" if and only if for every [X] e 
{0,1}" there exists a non-oscillating run (that is, an appropriate choice [N]) 
of M on [X]. Now we define t = Y and E t = {Y = Y} for which obviously 
{t} Et = {0, 1}". Therefore, [s] Es = [t} Et if and only if £ W (M) = {0, 1}". This 
concludes the proof of n^-hardness. 

For n^-membership, the problem can be characterized by the following 
analytical formula: V(A„At). 3{A' s ,A' t ). (A s \= E s \= E t A {s\ A " = 

[t]"^) A (At h E t A' s h E s A m At = ls} A '°). As in the proof of Theorem [TJ 
here, it suffices to quantify over countable models. □ 
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7 Equality for Behavioral Specifications 



In this section we consider the notion of equality from |17j which is based on 
hidden algebras [16] . We introduce the hidden models of bitstream specifications 
as employed in [17] . where it has been shown that deciding the equality of 
(equationally defined) streams, with respect to this semantics, is a n^-complete 
problem. We consider the following two extensions of this semantics: 

(i) extending the semantics to streams over natural numbers, or 

(ii) requiring the behavioral equivalence = to be a congruence. 

We show that both extensions lift the complexity of deciding equality to the 
level 11} of the analytical hierarchy. If the specifications are required to be pro- 
ductive (thus, separating the problem of productivity |10) from that of equality) 
it can be shown that the complexity resides at Hi [T3]. The results in [T7] (as 
well as the results we mention in the current paper) are based on the comparison 
of non-productive specifications, and the proofs inherently encode productivity 
problems. 

Let us briefly explain why the 11} -completeness for the equality of bit- 
streams in Theorem Q] docs not directly carry over the setup of [17; . The 
problem is the definition of the function natstr in (TT2"|) containing the equa- 
tion natstr(ones) = zeros. This equation does not work if we have confusion 
in the models and behavioral equivalence is not a congruence. In particu- 
lar, as discussed in Section [2j if ones' = 1 : ones', wc cannot conclude that 
natstr(ones') = zeros. As a consequence, with the behavioral specifications 
of [T7] it is not possible to enforce that a bitstream always eventually con- 
tains a zero. However, if we consider behavioral specifications of streams of 
natural numbers, then we no longer need natstr, hence, reestablishing the In- 
completeness result for the equality of streams of natural numbers specified 
bchaviorally. There is a similar problem with the equation is zeros (zeros) = ones, 
that, however, can be overcome by discarding IS zer Q5 clS 111 the proof of Thcorem[3J 

7.1 Basic Setup 

In [17] , every bitstream specification contains the equations 

head(x : a) — x ta\\(x : cr) = a 

where head :: S — > B and tail :: S — > S. 

Definition 20. A hidden Yj-algebra A — (A. [■]} consists of 

(i) an <S-sorted domain A where Ab — {0, 1}, 

(ii) for every / :: s\ x . . . x s n — > s G £ an interpretation [/] : A Sx x. . . A Sn — > A s , 
(in) 0,1 S S with [0] = and [1] = 1. 

We stress that now ^4^ is an arbitrary set. 
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Definition 21. Let A = (A, [•]) be a hidden S-algebra. Then a, r 6 As are 
called behaviorally equivalent, denoted by a = t, if they are indistinguishable 
with {head, tail}-experiments, that is: 

a = t VneN. [head] ([tail] n (o) = [head]([tail]"(r) 

On the domain As, we let = be the identity relation. 

Note that = is a not a congruence (only for [head] and [tail]). 

Definition 22. Let E be a bitstream specification over S. A hidden S-algebra 
behaviorally satisfies E, denoted A ^ E, if for every equation of E, the left- 
and right-hand sides are behaviorally equivalent: 

\£} a = [rj a for every £ = r £ E and a : X ->• A 

We say that an equation ^ = r is behaviorally satisfied in all hidden models of 
E, denoted E ^ £ — r if A ^ E implies A ^ £ = r for every hidden S-algebra 
A. 

For a discussion of this semantics, we refer to Section [5J 

7.2 Behavioral Equivalence as Congruence 

We now adapt the basic setup by requiring = to be a congruence relation, that 
is, s = t implies /(..., s, .. .) = /(..., t, . . .). The resulting models are called 
behavioral in [2], 

Definition 23. A hidden E-algebra is called behavioral if = is a congruence 
relation. For a bitstream specification E over E, we say that I — r is behaviorally 
satisfied in all behavioral models of E if A ^ E => A ^ £ = r for every 
behavioral hidden E-algcbra A. 

Theorem 12. The following problem is Tl\-complete: 
Input: Bitstream specification E, terms s,t :: S. 

Question: Is s = t satisfied in all behavioral models of E? 

Proof. We show: the equation s = t is behaviorally satisfied in all behavioral 
models of E if and only if s = t holds in all models of E; the latter property is 
nj-complete by Theorem [TJ 

The direction '<=' follows immediately, since every S-algebra is a behavioral 
hidden S-algebra. For '=£>', let A = {A, [•]) be a hidden E-algebra. Let A/= = 
(A/=, [•]/=) be the quotient algebra. That is, A/= are the congruence classes 
of A with respect to =. For symbols / 6 S and Bi, . . . , i? a r(/) 6 A/=, we define 
[/l/=(Bi,...,fl or(/) ) = B if [/](&!,...,&«.(/)) = 6 for h e B u ...,b ar[f) e 
B ar U)' an d S is the congruence class of b with respect to =. The quotient 
algebra A/= is a behavioral hidden E-algebra that, due to = being a congruence, 
behaviorally satisfies the same equations as A. Let A' be the S-algebra obtained 
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from A/= by renaming the domain elements into the streams they represent, 
that is, a <E (A/=)s becomes [head] (a) : [head] ([tail] (a)) : . . .. Then [:](x,a) = 
x : <r, since in A/= every stream has a unique representative in the model. 
Hence, A' is a stream algebra. Moreover, for elements a, b of the domain of 
A/=, we have a = b iff a = b. Hence, A' is a model of an equation s = t if and 
only if s = t is behaviorally satisfied in A. □ 

7.3 Streams of Natural Numbers 

We briefly study hidden models with confusion, described in Section [2j for 
streams of natural numbers. A N-stream specification is now defined like a 
bitstream specification, except the sorts are S = {N, S}, and the symbols are 
:: N, s :: TV ->• N and ':' of type N x S -> S. We adapt the definition of 
hidden E-algebras accordingly. 

Definition 24. A hidden 'E-algebra A = (A, [•]} consists of 

(i) an <S-sorted domain A and = N, 

(ii) for every / :: s\ x . . . x s n — > s € E an interpretation [/] : A Sl x . . . A Sn — > A s , 

(iii) 0, s € E with [0] = and [s](x) = a; + 1, 

(iv) for every s € Ag there are n € N and s' £ such that we have s = [:] (6, s'); 
sec further Remark ??. 

The definitions of behavioral equivalence and satisfaction are the same as 
for bitstream specifications. A slight modification of the proof of Theorem [3] 
results in the following. 

Theorem 13. The following problem is Ti\-complete: 
Input: N-stream specification E, terms s,t :: S. 

Question: Does E ^ s = t hold? That is, is s = t behaviorally satisfied in all 
hidden models of E? 

Proof. We reduce the well-foundcdness problem for decidable binary relations to 
an equality problem. Let M C NxN be a decidable predicate, and M = (Q, qo, 6) 
the corresponding Turing machine. We define the following specification E: 

zeros = run(l, X) unary(O) = zeros 
run(0, fi) = ones unary(s(x)) = 1 : unary(a;) 
run(l, a) = : run(M(zeros; unary(head(cr)), 

u nary (head (tail(cr)))), tail (a)) 

together with the equations from Em and In contrast with the proof of 
Theorem [3l X is now a stream of natural numbers. Since X is unspecified, its 
interpretation in the model can be an arbitrary stream of natural numbers. As 
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in the proofs of Theorems [JJ and [3J we employ X to guess an infinite path through 
M. Instead of uhd(-) and utl(-) on bitstreams, we now take unary(head(-)) and 
tail(-), respectively, where the function unary converts natural numbers to unary 
representations in forms of streams. As in the proof of Theorem [31 it follows 
that there exists a hidden E-algebra A with A ^ E if and only if M is not 
well-founded. Thus, E ^ zeros = ones if and only if M is well-founded. □ 

8 Equivalence of Lambda Terms 

In this section we investigate the complexity of deciding the equality of A- terms 
with respect to the observational equivalences =„/, —hnf and = w fmf as intro- 
duced in Section [TJ Furthermore, we study the complexity of deciding whether 
two A-terms have the same Bohm trees or Levy-Longo trees. The interested 
reader is referred to [TJ [7] for an introduction to Bohm trees, and to [5] for a 
thorough study of the observational equivalences on A-terms. 

Definition 25. Let M be a A-term. The Bohm tree BT(M) ofM is a potentially 
infinite term defined as follows. If M has no hnf, then BT(M ) = _L. Otherwise, 
there is a head reduction M -^* h Xx±. . . . Xx n .yM\ . . . M m to head normal form. 
Then we define BT(M) = Xx x . . . . Air„.yBT(Mi) . . . BT(M m ). 

Definition 26. Let M be a A-term. The Levy-Longo tree LT(M) of M is a 
potentially infinite term defined as follows: 

LT(M) = 1 if M has no whnf 

LT(M) = \x.U(N) if M Xx.N 

LT(M) = xLT(Mi) . . . LT(M m ) if M -+* h xM 1 ...M m 

For the observational equivalences we obtain: 

Theorem 14. For each =■? G {= n , =fc, — w }, the following problem is Un- 
complete: 

Input: X-terms M, N. 

Question: Does M =? N hold? 

Proof. First, we show n^-membership of the problem. We consider = n and 
= w work analogously). A A-term Q has a normal form if and only if Q admits a 
standard reduction — >-* td to a normal form, see [lj. For a A-term Q, and n G N, 
we write Q — >j t ^ nf to denote that Q rewrites to a normal form within < n 
steps of standard reduction. Note that this is a decidable property. Then we 
claim: 

M =„ N ^ (23) 
VC. Vn. 3m. [c[M] nf & C[N] nf) 

For in ([23]) . assume that M = n N. Let C be a context. We distinguish the 
following cases: 
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(i) Assume that C[M] has a normal form. Then C[N] has one, and C[M] — >^ td 
nf and C[N] — >-j t( 2 nf for some £ N. Then in (|23[) for any n £ N we can 
choose m = max(fc,€). 

(ii) The case that C[N] has a normal form is symmetric to (i) 

(iii) If neither C[M] nor C[N] have a normal form, then neither C[M] ->^ +m u/ 
nor C[N] — >f t ^ +m nf for any n, m £ N. 

For '<=' in (|23p . assume M ^„ iV. Then there is a context C such that exactly 
one of the terms C[M] and C[N] has a normal form; without loss of generality, 
assume C[M] n / for somc 71 6 N - Hence, C[M] -^ +m u/ for every 

m £ N, but C[iV] ^^ +m n/ for no m € N. Thus, the right-hand side of ([12]) 
is not satisfied. 

From (P2B1 it follows that =„ is in nS], since the two quantifiers VC and Vn 
can be merged into a single V- quantifier. 

We now proceed with proving iLj-hardness of the problem. Let T be a Turing 
machine, and let T be a A-term such that for all n, m £ N, Trim rewrites to K if 
T terminates on input n within m steps, and to Kl, otherwise. Here, K = Xxy.x 
and I = Ax. a; are the usual combinators, and k = Xf.\x.f n x is the Church 
numeral representing the natural number k £ N. The construction of such T is 
standard, see [T]. Now we define: 

M = (\x.Xa.a(xx))(Xx.Xa.a(xx)) 

N = N'N'zer N' = Xxn.T'n zer(Xa.a(xx(succ n))) 
T = y"T" T = Xxnm.Tnm\(xxn(5uccm)) 

zer = Xfx.x succ = Xzfx.f(zfx) 

We show that M =? N if and only if T halts on all n £ N. Note that T'n m 
I if Tnm — >* K, that is, if T terminates on input n in m steps; otherwise 
T'nm — >* T'n ( m + 1 ). Hence, we obtain 

T'nO — >* I <S=^ T halts on input n 

T'nO has a (weak) head normal form 

The Levy-Longo tree of M is Xa.a(Xa.a(Xa.a . . .)). If T halts on input n, we 
have 

N'N'n^* T'nzer(Xa.a(N'N' n+l )) Xa.a(N'N' n±l ) 

Thus if T terminates on all n £ N, then the Levy-Longo trees of M and N are 
equal, and, hence, by [6] we have M — w N, M =/, N and M = n N. Otherwise, 
let n £ N be minimal such that T does not halt on n. Then by the above, we 
have: 

N H>* Xa. a(Xa. a{. . . Xa. a(N'N'n) . . .)) 

s / 

n-timcs 

Then N\ n ->* N'N'n has no (weak) head normal form, but Ml™ has. Thus we 
have M ^„ AT, M ^ h N and M ^ w N. This proves n^-hardness. □ 
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The proof immediately yields the following result: 

Theorem 15. The following problems are Tl^-complete: 
Input: X-terms M, N. 

Question: (i) Do s and t have equal Bohm trees? 

(ii) Do s and t have equal Levy-Longo trees? 

Proof. Follows immediately from the proof of Theorem [14] since M and N are 
observationally equal if and only if they have the same Levy-Longo tree, and 
for M and N the Levy-Longo trees coincide with their Bohm trees. □ 

We mention that for Berarducci trees, the proof of Theorem [T4l implies n!]- 
hardness. It is not difficult to see that the problem of deciding the equality of 
Berarducci trees is in LT3 . We leave the determination of the precise complexity 
to future work. 

9 Conclusions 

We have investigated different model-theoretic and rewriting based semantics 
of equality of infinite objects, specified either by systems of equations or by 
A-terms. It turns out that the complexities for these notions vary from the low 
levels of the arithmetical hierarchy and Ilf], up to LT} and of the ana- 
lytical hierarchy, and some even subsume the entire arithmetical and analytical 
hierarchy. In particular, the observational equivalences of A-terms, that are of 
interest for functional programming, are all n^-complete. 

Apart from n", none of these classes are recursively enumerable or co- 
recursively enumerable. Thus, there exists no complete proof systems for prov- 
ing or for disproving equality. An exception is the equality of normal forms 
for productive specifications for which inequalities can be recursively enumer- 
ated [15] . 
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